Automatic pronunciation evaluation and classification

نویسندگان

  • Om Deshmukh
  • Sachindra Joshi
  • Ashish Verma
چکیده

Pronunciation evaluation is an important module of every spoken language evaluation system. Automatic evaluation of quality of pronunciation that can mimic the performance of human assessors is a difficult task as human assessment accounts for several nuances of pronunciation including vowel substitutions and quality of consonants. This paper presents a novel approach that combines the knowledge of human assessment and the knowledge of the behaviour of automatic speech recognition systems to develop features for pronunciation evaluation. Instead of presenting the correlation of the proposed features with human assessment, the paper presents sentence-level classification accuracies which can directly be used in real-life applications. Inter-human and intra-human agreements, which are indicative of human subjectivity, are also presented. The trends in confusions among humans scores and automatic scores are compared as the number of classification classes is varied.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phonetic Distance Based Accent Classifier to Identify Pronunciation Variants and Oov Words

The state-of-the-art Automatic Speech Recognition (ASR) systems lack the ability to identify spoken words if they have non-standard pronunciations. In this paper, we present a new classification algorithm to identify pronunciation variants. It uses Dynamic Phone Warping (DPW) technique to compute the pronunciation-by-pronunciation phonetic distance and a threshold critical distance criterion fo...

متن کامل

Pronunciation Verification for Automatic Litera

Arguably the most important part of automatically assessing a new reader’s literacy is in verifying his pronunciation of read-aloud target words. But the pronunciation evaluation task is especially difficult in children, non-native speakers, and pre-literates. Traditional likelihood ratio thresholding methods do not generalize easily, and even expert human evaluators do not always agree on what...

متن کامل

Automatic evaluation of quantity contrast in non-native Norwegian speech

Computer assisted language learning (CAPT) has been shown to be effective for learning non-natives pronunciation details of a new language. No automatic pronunciation evaluation system exists for non-native Norwegian. We present initial experiments on the Norwegian quantity contrast between short and long vowels. A database of native and non-native speakers was recorded for training and test re...

متن کامل

Pronunciation verification of children²s speech for automatic literacy assessment

Arguably the most important part of automatically assessing a new reader’s literacy is in verifying his pronunciation of read-aloud target words. But the pronunciation evaluation task is especially difficult in children, non-native speakers, and pre-literates. Traditional likelihood ratio thresholding methods do not generalize easily, and even expert human evaluators do not always agree on what...

متن کامل

Auditory and Dynamic Modeling Paradigms to Detect L2 Mispronunciations

This paper expands our previous work on automatic pronunciation error detection that exploits knowledge from psychoacoustic auditory models. The new system has two additional important features, i.e., auditory and acoustic processing of the temporal cues of the speech signal, and classification feedback from a trained linear dynamic model. We also perform a pronunciation analysis by considering...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008